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1 Introduction 



This paper presents an approach to integration of normal logic programs 
under the well-founded semantics with external first-order theories. The 
problem is motivated by the discussion about the rule level of the Semantic 
Web. 

It is often claimed that rule-based applications need non-monotonic rea- 
soning for handling negative information. As the issue of non-monotonic 
reasoning and negation was thoroughly investigated in the context of logic 
programming (see e.g. |AB94j for a survey of classical work on this topic), 
it would be desirable to build-up on this expertise. Well-established for- 
mal semantics: the answer set semantics [BG94] , the well-founded semantics 
|vGRS88j . and the 3- valued completion semantics of Kunen |Kun87j provide 
theoretical foundations for existing logic programming systems. On the other 
hand, applications refer usually to domain-specific knowledge, that is often 
supported by specific reasoning or computational mechanisms. 

Domain-specific variants of logic programs are handled within the con- 
straint logic programming framework CLP [MSW06J. In CLP the concept 
of constraint domain makes it possible to extend the semantics of pure logic 
programs and to use domain-specific constraint solvers for sound reason- 
ing. Classical CLP does not support non-monotonic reasoning, but inte- 
gration of both paradigms is discussed by some researchers (see e.g. |Stu95l 
DS98, Fag97|). Special kind of domain-specific knowledge are domain-specific 
terminologies, specified in a formal ontology description language, such as 
OWL [PSHH04J. This raises the issue of integration of rules and ontologies, 
which has achieved a considerable attention (sec e.g. [Ros05, MSS05, EIST06, 



ADG + 05l IHBET08, MR07J and references therein). It is commonly assumed 



that an ontology is specified as a set of axioms in (a subset of) First Order 
Logic (FOL), usually in a Description Logic. A rule language, typically a 
variant of Datalog, is then extended by allowing a restricted use of ontology 
predicates. The extensions considered in the literature are mostly based on 
disjunctive Datalog with negation under the answer set semantics. 

In contrast to that, our focus is on normal logic programs under the 
well-founded semantics. Our objective is to extend them in such a way that 
domain-specific knowledge represented by a first order theory can be accessed 
from the rules. The theory will be called the external theory. Going beyond 
Datalog makes it possible to use data structures like lists for programming in 
the extended rule language. We want to define the semantics of the extended 
language so that the existing reasoners for normal logic programs and for the 
external theory can be re-used for querying the extended programs. Thus, 
our objective is to provide a framework for hybrid integration of normal logic 
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programs and external theories. Integration of Datalog rules with ontologies 
specified in a Description Logic could then be handled within the framework 
special case. 

The choice of the well-founded semantics as the basis for our approach 
is motivated by existence of top-down query answering algorithms, which 
facilitate building a query answering method for our framework. Notice also 
that the well-founded semantics and the answer set semantics are equivalent 
for a wide class of programs, including stratified normal programs. 

We introduce a notion of hybrid program; such a program is a pair (P, T) 
where T is a set of axioms in a first order language £ and P is a set of 
hybrid rules. T and P share function symbols but have disjoint alphabets of 
predicate symbols. This reflects the intuition that domain-specific knowledge 
is shared by many applications and is not redefined by the applications. A 
hybrid rule is a normal clause whose body may include a formula in C, called 
the constraint of the rule. We define declarative semantics of hybrid programs 
as a natural extension of the well-founded semantics of logic programs. It is 
3-valued; the semantics of a program (P, T) is a set of ground literals over 
the alphabet of P. The semantics is undecidable (as is the well-founded 
semantics for normal programs). However it is decidable for Datalog hybrid 
programs, provided the constraints are decidable. 

The operational semantics presented in this paper is goal driven, allows 
non ground goals, and its internal data are possibly non ground goals and 
constraints. It combines a variant of SLS-resolution (see e.g. |AB94j ) with 
handling constraints. The latter includes checking satisfiability of the con- 
straints w.r.t. T, which is assumed to be done by a reasoner of T. Thus the 
operational semantics provides a basis for development of implementations 
integrating LP (logic programming) reasoners supporting/ approximating the 
well-founded semantics (such as XSB Prolog [SS W + 07] ) with constraint 
solvers. The operational semantics is sound w.r.t. the declarative one, under 
rather weak sufficient conditions. It is complete for Datalog hybrid programs 
under a certain syntactic condition of safeness. 

In the special case of hybrid rules without non-monotonic negation the 
rules can be seen as the usual implications of the FOL, thus as additional 
axioms extending T. In this case every ground atom which is true in the 
semantics of the hybrid program is a (2- valued) logical consequence of PUT. 

The paper is organized as follows. Section [2] gives an (informal) introduc- 
tion to the well-founded semantics of normal logic programs, and presents 
the notion of constraint used in this paper. Basic ideas of Description Logics 
and their use for defining ontologies and ontological constraints are briefly 
discussed. Section [3] gives a formal presentation of the syntax and declar- 
ative semantics of the generic language of hybrid rules, parameterized by 



3 



the constraint domain. Section H] introduces the operational semantics; then 
soundness and completeness results relating the declarative semantics and 
the operational semantics are stated and proven. The last two sections con- 
tain discussion of related work and conclusions. A preliminary, abbreviated 
version of this work appeared as [DM07] . 

2 Preliminaries 

2.1 Normal logic programs and the well-founded se- 
mantics 

In this work we use the standard terminology and notation of logic program- 
ming (see e.g. |Apt97| ). 

The language of hybrid rules will be defined as an extension of normal 
logic programs. We assume that the programs are built over a first-order 
alphabet including a set Vr of predicates, a set V of variables and a set T of 
function symbols with different arities including a non-empty set of symbols 
of arity 0, called constants. 

Atomic formulae (or atoms) and terms are built in a usual way. A literal 
is an atomic formula (positive literal) or a negated atomic formula (negative 
literal). A literal (a term) not including variables is called ground. 

A normal logic program P is a finite set of rules of the form 

H <— Bi , . . . , B n where n > 

where H is an atomic formula, and B\, B n are literals. The rules are also 
called normal clauses. The rules with empty bodies (n = 0) are called facts or 
unary clauses; they are usually written without <—. A normal clause is called 
definite clause iff all literals of its body are positive. A definite program is a 
finite set of definite clauses. In this paper, a Datalog program is a normal 
logic program with T being a finite set of constants. 

The Herbrand base Hp is the set of all ground atoms built with the 
predicates, constants, and function symbols of P. For a subset S C Tip, by 
-iS we denote the set of negations of the elements of S, = { ->a \ a 6 S }. 
A ground instance of a rule R is a rule R' obtained by replacing each variable 
of R by a ground term over the alphabet. The set of all ground instances of 
the rules of a program P will be denoted ground(P) . Notice that in the case 
of Datalog ground(P) is a finite set of ground rules. 

A 3-valued Herbrand interpretation (shortly - interpretation) X of P is 
a subset of Tip U —Hp such that for no ground atom A both A and ->A are in 
I. Intuitively, the set I assigns the truth value t (true) to all its members. 
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Thus A is false (has the truth value f ) in X iff ->A G X, and ->A is false in X 
iS A el. If A X and ->A £ X then the truth value of A (and that of ->A) 
is u (undefined). This is in a natural way generalized to non ground atoms 
and non atomic formulae (see e.g. |AB94|). An interpretation X is a model 
of a formula F (which is denoted by X |=3 F) iff F is true in X. 

As usual, a 2-valued Herbrand interpretation is a subset of Hp. It assigns 
the value t to all its elements and the value f to all remaining elements of the 
Herbrand base. It is well known that any definite program P has a unique 
2-valued Herbrand model. We will denote it Aip. A normal program 
may not have the least Herbrand model. 

The well-founded semantics of logic programs [vGRS88j assigns to every 
program P a unique (three valued) Herbrand model, called the well-founded 
model of P. Intuitively, the facts of a program should be true, and the ground 
atoms which are not instances of the head of any rule should be false. This 
information can be used to reason which other atoms must be true and which 
must be false in any Herbrand model. Such a reasoning gives in the limit 
the well-founded model, where the truth values of some atoms may still be 
undefined. The well-founded semantics has several equivalent formulations. 
We briefly sketch here a definition following that of |FD93] . 

While defining the well-founded model, for every predicate symbol p we 
will treat —up as a new distinct predicate symbol. A normal program can 
thus be treated as a definite program over Herbrand base H U —0~L. A 3- 
valued interpretation over H can be treated as a 2-valued interpretation over 
HU-H. 

Let I be such an interpretation (/ C H U -iH). We define two ground, 
possibly infinite, definite programs P/tl and Pjt u I. For a given program P, 
P/fl is the ground instantiation of P together with ground unary clauses that 
show which negative literals are true in J. 

P/J = ground(P) U { ->A \ ->A G / } 

P/tuI is similar but all the negative literals that are true or undefined in / 
are made true here: 

P/ t J = ground(P) U { ->A \ A I, A e H} 

Now we define an operator ^p(I) which produces a new Herbrand inter- 
pretation of P: 

* P (i) = (M P/tI nn)u ->(H \ M P/tuI ) 

1 in the sense of set inclusion. 
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It can be proved that the operator is monotonic; ^p(I) C \l/p(J) whenever 
I C J. Its least fixed point is called the well-founded model WF(P) of 
program P. For some countable ordinal a we have WF(P) = ^p(0). 

The following example shows a simple Datalog program and its well- 
founded model. 

Example 2.1 A two person game consists in moving a token between ver- 
tices of a directed graph. Each move consists in traversing one edge from the 
actual position. Each of the players in order makes one move. The graph is 
described by a database of facts m(X, Y) corresponding to the edges of the 
graph. A position X is said to be a winning position X if there exists a move 
from X to a position Y which is a losing (non-winning) position: 

w(X) *- m(X,Y),^w(Y) 

Consider the graph , 

a — > e 

T I 

b <-> a — > c — > f 

and assume that it is encoded by the facts m(b,a),m(a,b), . . . ,m(e, f) of 
the program. The winning positions are e, c. The losing positions are d, f. 
Position a is not a losing one since the player has an option of moving to 
b from which the partner can only return to a. This intuition is properly 
reflected by the well-founded model of the program, it contains the following 
literals with the predicate symbol w: w(c),w(e), ->w(d), ->w(f). 

A non Datalog version of this example with an infinite graph is presented 
in |Dra93] . 

2.2 External theories 

In this section we discuss logical theories to be integrated with logic programs. 
2.2.1 Constraints 

Our objective is to define a general framework for extending normal logic 
programs, which, among others, can also be used for integration of Data- 
log rules with ontologies. Syntactically, the clauses of a logic program are 
extended by adding certain formulae of a certain logical theory. The added 
formulae will be called constraints. We use this term due to similarities 
with constraint logic programming [MSW06 . 

We will consider a 2-valued FOL theory, called external theory or con- 
straint theory. A set of its formulae is chosen as the set of constraints. Our 
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operational semantics imposes certain restrictions on the set of constraints. 
They are introduced together with the operational semantics. The declara- 
tive semantics works for an arbitrary set of constraints. The function symbols 
and the variables of the language of the external theory are the same as those 
of the language of rules. On the other hand, the predicate symbols of both 
languages are distinct. We will call them constraint predicates and rule pred- 
icates. We assume that the external theory is given by a set of axioms T 
and the standard consequence relation h of the FOL, or equivalently the log- 
ical consequence |= . (Other consequence operations can be used instead; for 
instance deriving those formulae which are true in a canonical model of T, 
or in a given class of models.) We will sometimes use T as the name of the 
theory. 

Sometimes one deals with an external theory whose set T c of function 
symbols is a proper subset of the set T of function symbols of the rules. For 
instance the external theory uses only constants, and the rules employ term 
constructors (i.e. non constant function symbols). In such case we simply 
extend the alphabet of the external theory so that its set of function symbols 
is T . The modified external theory is a conservative extension of the original 
one [Sho67j. A formula without symbols from T\T C is a logical consequence 
of T in one of them iff it is a logical consequence in the other. Thus such 
modification of the external theory is inessential; this justifies our assumption 
of a common alphabet of function symbols. 

2.2.2 Ontologies and ontological constraints 

This section surveys some basic concepts of Description Logics (DLs) [BCM + 03] 
and the use of DLs for specifying ontologies. An ontology may be defined 
as a "specification of a conceptualization" |Gru95] . An ontology should thus 
provide a formal definition of the terminology to be shared. 

Desciption Logics are specific fragments of the FOL. The syntax of a DL 
is built over disjoint alphabets of class names, property names and individ- 
ual names. From the point of view of FOL they are, respectively, one and 
two argument predicate symbols, and constants. Depending on the kind of 
DL, different constructors are provided to build class expressions (or briefly 
classes) and property expressions (or briefly properties). Some DLs allow 
also to represent concrete datatypes, such as strings or integers. In that 
case one distinguishes between individual-valued properties and data-valued 
properties. 

By an ontology we mean a finite set of axioms in some decidable DL. The 
axioms describe classes and properties of the ontology and assert facts about 
individuals and data. An ontology is thus a DL knowledge base consisting 
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of two parts: a TBox (terminology) including class axioms and property 
axioms and an ABox (assertions) stating facts about individuals and data. 
The axioms of DLs can be seen as an alternative representation of FOL 
formulae. Thus, the semantics of DLs is defined by referring to the usual 
notions of interpretation and model, and an ontology can be considered a 
FOL theory. 

For most of decidable DLs there exist well developed automatic reasoning 
techniques. Given an ontology T in a DL one can use a respective reasoner 
for checking if a formula C is a logical consequence of T. If T \^ C and 
T \f -<C then C is true in some models of the ontology and false in some 
other models. 

Ontologies are often specified in the standard Web ontology language 
OWL DL, based on the Description Logic SHOlN{D). OWL Ontologies 
can be seen as set of axioms in this DL. 

OWL DL class axioms make it possible to state class equivalence A = C 
and class inclusion ACC, where A is a class name and C is a class expression. 
Class expressions are built from class names using constructors, such as T 
(the universal concept), _L (the bottom concept), intersection, union and 
complement. Classes can also be described by direct enumeration of members 
and by restrictions on properties (for more details see [PSHH04J). 

Property axioms make it possible to state inclusion and equivalence of 
properties, specify the domain and the range of a property, state that a 
property is symmetric, transitive, functional, or inverse functional. 

OWL DL assertions indicate members of classes and properties. Indi- 
viduals are referred to by individual names. It is possible to declare that 
given individual names represent the same individual or that each of them 
represents a different individual. 

The following example using some expressive constructions of OWL DL 
will be used in the sequel to discuss how integration of Datalog with OWL 
DL ontologies is achieved in our framework. 

Example 2.2 In some research area an author of at least 3 books is con- 
sidered an expert. An OWL DL ontology referring to this research area has 
classes Author and Book, and a property isAuthorOf with domain Author 
and range Book. The class Expert can now be defined using OWL DL 
cardinality restriction^ 

Expert = isAuthorOf min 3 Book 

The property isAuthorOf has the inverse property has Author . The fol- 
lowing class expression defines the class of authors which co-authored a book 
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with a given author X (e.g. smith) 

(isAuthorOf some (hasAuthor value X)) 

All individuals of class Book which appear in the ontology are declared as 
distinct. The ontology states that the individuals johns and brown of class 
Author are the same. (This may happen e.g. due to a change of the name of 
a person). There are also authors smith and burns; smith, burns and johns 
are declared to be distinct. In addition, the ontology asserts that johns is 
the author of the books 61, 62 and brown is the author of the books 62, 63. 
Thus an OWL DL reasoner will conclude that johns (brown) is an expert. 

2.3 Datalog with Constraints: Introductory Examples 

We now illustrate the idea of adding constraints to rule bodies on two simple 
examples. The intention is to give an informal introduction to the seman- 
tics of hybrid rules. The first example will be used later on to accompany 
the formal presentation of the declarative and operational semantics of our 
framework. The second one illustrates some aspects of expressing external 
theories in OWL DL. 

Example 2.3 The example describes a variant of the game from Exam- 
ple 12.11 where the rules are subject to additional restrictions. Assume that 
the positions of the graph represent geographical locations described by an 
ontology. The ontology provides, among others, the following information 

• subclass relations (TBox axioms): e.g. Fi C E (locations in Finland 
are locations in Europe); 

• classification of some given locations represented by constants (ABox 
axioms). For instance, assuming that the positions of Example 12.11 
represent locations we may have Fi(b) (6 is a location in Finland), 
E(c) (c is a location in Europe). 

We now add some restrictions as ontological constraints^ added to the 

3 Symbol -i is used to denote two kinds of negation. Within a constraint it is the 
classical negation of the external theory. When applied to a rule predicate, -i denotes 
nonmonotonic negation. Thus two distinct negation symbols are not needed. 
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facts m(e, f) and m(c, /): 



w(X) «- m(X,Y),-w(Y) 

m(b,a) m(c, f) <- ^Fi(f) 

m(a,b) m(e, /) <- J5(/) 

m(a, c) 
m(c, d) 
m(d, e) 



ci — > e 

T I B « 

c — > / 



Intuitively, this would mean that the move from e to / is allowed only if / 
is in Europe and the move from c to / - only if / is not in Finland. These 
restrictions may influence the outcome of the game: / will still be a losing 
position but if the axioms of the ontology do not allow to conclude that / is 
in Europe, we cannot conclude that e is a winning position. However, we can 
conclude that if / is not in Europe then it cannot be in Finland. Thus, at least 
one of the conditions E(f), ->Fi(f) holds. If E(f) then, as in Example 12.11 
e is a winning position, d is a losing one, hence c is a winning position. On 
the other hand, if ->Fi(f) then the move from c to / is allowed, in which 
case c is a winning position. Therefore c is always a winning position; w(c) 
is considered to be a consequence of the program. 

Example 2.4 A committee of reviewers is to be created for evaluation of the 
applicants for an opened position. A reviewer has to be an expert, as defined 
by the ontology of Example 12.21 and must not have a conflict of interest (coi) 
with an applicant. Persons who are co-authors of a book have coi. (This 
implies that an author of a book has coi with himself /herself; this applies 
in particular to each expert). Additionally, some conflicts of interest are 
declared by facts. 

The following rules define a potential reviewer X for a candidate Y (re- 
lation mayreview). Two constraints are used: Expert(X) and (isAuthorOf 
some (hasAuthor value X))(Y). They refer to the ontology of Ex. 12.21 

mayreview (X, Y) <— Expert(X), -*coi(X, Y) 

coi(X,Y) <— (isAuthorOf some (hasAuthor value X))(Y) 

coi(johns, burns) 

The intention is to query the rules and the ontology for checking if a given 
person may be a reviewer for a given candidate. Consider the individual 
johns of Example 12.21 and check if she might be appointed a reviewer for 
some of the people named in the ontology. An OWL DL reasoner can check 
that johns is an expert and that she has the conflict of interest with herself, 
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i.e. with johns alias brown. The conflict of interest with burns is stated 
explicitly. So johns cannot be appointed a reviewer for herself and for burns. 

To check if johns has the conflict of interest with smith one has to refer to 
the ontology for checking if they co-authored a book. If this is confirmed by 
the reasoner (e.g. when the ontology asserts that both johns and smith are 
authors of 61) then coi(johns, smith) is true and johns cannot be a reviewer 
for smith. If non-existence of any co-authored book follows from the ontol- 
ogy, then coi{johns, smith) is false and johns can be a reviewer. Otherwise^ 
johns may be a reviewer for smith under the condition that they did not 
co-authored a book. This constraint should be returned in the answer to the 
query. 

An example employing non-nullary function symbols is given in |DHM07aJ. 
The semantics of hybrid programs presented below formalizes the intuitions 
presented in the examples of this section. 



3 Integration of rules and external theories 

This section defines the syntax and the (declarative) semantics of hybrid 
programs, integrating normal rules with first-order theories. The general 
principles discussed here apply in a special case to integration of Datalog 
with ontologies specified in Description Logics. 



3.1 Syntax 

We consider a first-order alphabet including, as usual, disjoint alphabets of 
predicate symbols V, function symbols T (including a set of constants) and 
variables V. We assume that V consists of two disjoint sets Vr {rule predi- 
cates) and Vc {constraint predicates) . The atoms and the literals constructed 
with these predicates will respectively be called rule atoms {rule literals) and 
constraint atoms {constraint literals). We will combine rules over alphabets 
Vr, T , V with an external theory T over Vc, J~ , V, employing constraints 
(a distinguished set of formulae of T). 

Definition 3.1 A hybrid rule (over Vr, Vc, J~ , V) is an expression of the 
form: 

H C,L\, . . . , L n 

where, n > each Lj is a rule literal and C is a constraint (over Vc, V); 
C is called the constraint of the rule. 

4 They are co-authors in some models of the ontology, and are not in some others. 
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A hybrid program is a pair (P, T) where P is a set of hybrid rules and 
T is a set of axioms over Vc, V. □ 



Hybrid rules are illustrated in Example 12.31 We adopt a convention that 
a constraint true, which is a logical constant interpreted as t, is omitted. 
Usually we do not distinguish between sequences, like Li, . . . , L n , and con- 
junctions, like L\ A . . . A L n . Notation L will be used to denote a sequence 
of rule literals (similarly t a sequence of terms, etc.); t=u will denote a con- 
junction of equalities t\=U\, . . . , tk—Uk- 

3.2 Declarative Semantics 

The declarative semantics of hybrid programs is defined as a generalization 
of the well-founded semantics of normal programs; it refers to the models of 
the external theory T of a hybrid program. Given a hybrid program (P, T) 
we cannot define a unique well-founded model of P since we have to take into 
consideration the logical values of the constraints in the rules. However, a 
unique well-founded model can be defined for any given model of T. Roughly 
speaking, the constraints in the rules are replaced by their logical values in 
the model (t or f); then the well-founded model of the obtained logic program 
is taken. The well-founded models are over the Herbrand universe, but the 
models of T are arbitrary. 

By applying a substitution 6 = {x±/ti, . . . , x n /t n } to a formula F we 
mean applying it to the free variables of F. Moreover, if a bound variable x 
of F occurs in some ti (1 < % < n) then x in F is replaced by a new variable. 

By a ground instance of a hybrid rule H <— C, L\, . . . , L n , where C is 
the constraint of the rule, we mean any rule H9 <— C6, L\6, . . . , L n 8, where 9 
is a substitution replacing the free variables of H <— C, L%, . . . , L n by ground 
terms (over the alphabet J 7 ). So the constraint CO has no free variables, and 
H9, LiO, . . . , L n 9 are ground literals. By ground(P) we denote the set of all 
ground instances of the hybrid rules in P. 

Definition 3.2 Let (P, T) be a hybrid program and let Mo be a model of 
T. Let P/Mq be the normal program obtained from ground(P) by 

• removing each rule constraint C which is true in Mo (i.e. Mo |= C), 

• removing each rule whose constraint C is not true in M , (i.e. M \£ C). 

The well-founded model WF(P/M ) of P/M is called the well-founded 
model of P based on M . 

A formula F (over Vr, J 7 , V) holds (is true) in the well-founded semantics 
of a hybrid program (P, T) (denoted (P, T) |= w f F) iff M |=3 F for each well- 
founded model M of (P, T). □ 
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Notice that the negation in the rule literals is non-monotonic, and the 
negation in the constraints is that from the external theory, thus monotonic. 

We say that F is false in the well-founded semantics of (P, T) if (P, T) |= w f 
->F, and that F is undefined if the logical value of F in each well-founded 
model of (P, T) is u. There is a fourth case: F has distinct logical values in 
various well-founded models of P. Formally, the semantics of (P, T) does not 
assign any truth value to such F. We may say that its truth value depends on 
the considered model of the external theory. Classes of models in which F has 
a specific truth value can by characterized by constraints. Such constraints 
provide sufficient conditions for F to have the specific truth value. They are 
constructed by the proposed operational semantics. 

Example 3.3 For the hybrid program (P, T) of Example 12.31 we have to 
consider models of the ontology T. For every model M of T such that M \= 
E(f) the program P/M includes the fact m(e, f). The well-founded model 
of P/Mq includes thus the literals ->w(f),w(e),-iw(d),w(c) (independently 
of whether M |= Fi(f)). 

On the other hand, for every model Mi of the ontology such that Mi |= 
-iFi(f) the program P/Mi includes the fact m(c, /). The well-founded model 
of P/M\ includes thus the literals -iw(f),w(c) (independently of whether 

M 1 \=E(f)). 

Notice that each of the models of the ontology falls in one of the above 
discussed cases. Thus, w(c) and ->w(f) hold in the well-founded semantics of 
the hybrid program, and the logical value of w(a) and that of w(b) is u in each 
well-founded model of the program. On the other hand w(e) and ->w(d) are 
true in those well-founded models WF(P/M ) of P for which the constraint 
E(f) is true in Mq. Similarly, ->w(e) and w(d) are true in those models for 
which E(f) is false. Thus the well-founded semantics assigns unique truth 
values to w(a),w(b),w(c) and w(f), but not to w(d) and w(e). The truth 
values of w(d) and w(e) can be characterized by additional constraints. 

Consider a case of hybrid rules without negative rule literals. So the non- 
monotonic negation does not occur. Such rules can be seen as implications 
of FOL and treated as axioms added to T. For such case the well-founded 
semantics is compatible with FOL in the following sense: For any ground rule 
atom A if (P, T) Kf A then P U T |= aE We omit a detailed proof. 

5 The reverse implication does not hold. As a counterexample take T = { 3x.q(x) } 
and P = {p <— q{x), r(x); r(x) <—}. PUT \= p but (P, T) ^ wf p, as there exist models of 
T in which each ground atom q(t) is false. 

We can obtain (something close to) the reverse implication by considering only those 
well-founded models which are based on Herbrand models of T. If PUT |= A then M \= A 
for each well-founded model M of P based on a Herbrand interpretation Mq of T. 



13 



As the well-founded semantics of normal programs is undecidable, so is 
the well-founded semantics of hybrid programs. It is however decidable for 
Datalog hybrid programs with decidable external theories (Section [5]). In 
Section H] we show that sound reasoning is possible (for arbitrary hybrid 
programs) by appropriate generalization of SLS-resolution. For the Datalog 
case the proposed reasoning scheme is complete under a certain safeness 
condition. 



3.3 Treatment of Equality 

In this section we discuss how equality is treated by the declarative semantics 
introduced above. The semantics is based on Herbrand models. Thus it treats 
distinct ground terms as having different values. 

Example 3.4 Consider a hybrid program (P, T), where P = {p(a) }. Both 
p(a) and ~<p(b) hold in the well-founded semantics of (P, T), even if T implies 
that a and b are equal. This feature of the semantics of hybrid programs may 
be found undesirable. 

We will call this phenomenon the problem of two equalities. Below we 
first show that the problem is well known in constraint logic programming 
(CLP) and explain how it is dealt with. Then we discuss two more formal 
ways of avoiding it: external theories where equality satisfies Clark equality 
theory (CET), and hybrid rules which are congruent w.r.t. a given external 
theory. 

The problem of two equalities is familiar from CLP [MSW06J, and is 
not found troublesome in practice. Most CLP implementations employ both 
syntactic equality and equality of the constraint domain^. Let us denote the 
latter by =' (and use = for the syntactic equality of the Herbrand domain). 
Formally, let us treat = as equality, and =' as an equivalence relation. As 
an example consider CLP over arithmetic constraints [MSW06] . Terms 2 + 2 
and 4 are distinct but denote the same number, we have 2 + 2^4 and 
2 + 2 =' 4. Constraint predicates treat 2 + 2 and 4 as equal. (Formally, 
=' is a congruence of the constraint predicates: p(ti, . . . ,t n ) iff p(ui, . . . , u n ) 
whenever t\ — Ui,...,t n — u n , for any constraint predicate p.) Other 
predicates may distinguish such terms. This is related to using unification in 
the operational semantics; unification is related to the syntactic equality. 

Apparently the programmers find this feature natural and not confusing. 
They are aware of dealing both with the Herbrand interpretation and with a 



6 See for instance the comment on an example constraint domain on p. 414 in [MSW06, 
Section 12.2]. 



14 



non Herbrand one. They know that the latter is employed only by constraint 
predicates. They take care of distinguishing the two corresponding equalities. 
For instance to express a fact that size should be true for the number 4, a 
rule size(N) <— N—4 will be used (instead of a fact size(4).). 

It what follows we refer to the free equality theory (CET, Clark equality 
theory) [Cla78j . CET consists of equality axioms 

x = x, 

x = y -»• fix) = f(y) for each / e J 7 , 

x = y — > (p(x) — > p(y)) for each predicate symbol p, including =, 
and freeness axioms 

f(x) = f(y) ->x = y for each f e J 7 , 
fix) 7^ g(y) for each pair of distinct f^gEJ 7 , 

x 7^ t for each non variable term t such that 

the variable x occurs in t. 

If the set T of function symbols is finite then CET additionally contains the 
weak domain closure axiom wdcaQ 

V 3y(x = f(y)). 

When T contains only constants then CET reduces to the unique name 
assumption (UNA). 

Assume that we have an external theory T with equality ='. We say that 
a set of hybrid rules P is congruent for a predicate symbol p w.r.t. T when 
T |= t\ — Ui, . . . , t n — u n implies 

(P, T) |= wf p(tx, ...,t n ) iff (P, T) ^ wf p(u!, ...,u n ), 

for any ground terms t\, . . . , t n , u%, . . . , u n . When P is congruent w.r.t. T for 
any rule predicate p then we say that P is congruent w.r.t. T (or shortly that 
(P, T) is congruent). 

Example 3.5 Program P = {p{a) } (from Ex. 13 .4j) is not congruent w.r.t. 
any T in which T \= a ^' b. 



7 This axiom is needed for CET to be complete, in the sense that any closed formula 
(with = as its only predicate symbol) has the same logical value in each model of CET. 
Consider for instance T = {a} and 3x(x ^ a). This formula is true in some models of 
CET without WDCA, but false in its (unique) Herbrand model. 
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The hybrid program from Ex. 12.41 with the fact coi(johns, burns) removed 
is congruent^, independently from T. 

Example 3.6 (Constructing congruent programs) Consider the pro- 
gram from Examples 12.31 13.31 The program implies w(c) and ~^w{g) (for- 
mally (P, T) |= w f w(c) and (P,T) |= w f ->w(g)). Assume that T implies that 
c =' g. For instance, the equality may be explicitly stated by an owl : sameAs 
assertion. Informally, equality c =' g is incompatible with P; the rules of P 
treat differently the objects c, g, while T states that they are equal. Formally, 
(P, T) is not congruent. 

One can modify P to make it treat c, g in the same way. It is sufficient to 
add rules m(a, g), m(g, d), and m(g, f) <— -iFi(f). (We replace c by g in the 
rules of P). Now iu(c) and w(g) hold in the well-founded semantics of the 
obtained program (P',T). The program is congruent, provided that T does 
not imply £j =' £2 for any other pair {ti,^} 7^ {c, g} of constants occurring 
in the program. 

We can modify P to make it congruent independently from T. The idea 
is to replace (implicit) = by explicit ='. For instance we may replace in P 
the rule w(X) m(X, Y), ->w(Y) by 

w(x) <- x =' x', y =' r, m(x', r), ^™(y). 

The obtained program (P", T) is congruent for w, w.r.t. any T. Alterna- 
tively, the rules for m can be modified in a similar way to make the program 
congruent for m. Then the program is also congruent for w (without modi- 
fying the rule for w). 

The program transformations above can be seen as usual CLP program- 
ming tricks. 

For congruent hybrid programs the problem of two equalities does not 
exist. (Also, it does not exist for external theories without equality.) The 
example above informally introduces programming techniques for construct- 
ing congruent programs. Now we present two simple criteria assuring that 
a program is congruent. (Congruency is undecidable, like other non trivial 
semantic properties of programs.) 

First, if the equality =' of T satisfies CET then each program (P,T) is 
congruent. (As then if t, u are ground terms then t — u implies that the terms 
are identical.) Apparently for this reason some approaches of combining rules 

8 The unchanged program is not congruent, unless the ontology implies that johns 
(brown) co-authored a book with burns. This is because coi(johns, burns) holds and 
coi(brown, burns) does not hold in the well-founded semantics of the program, but 
johns — brown. 
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and ontologies require that the ontology satisfies the unique name assumption 
(UNA). 

Another sufficient criterion is syntactic. Program (P, T) is congruent if 
in each rule H <— C, L±, . . . , L n of P all the arguments ti,...,t n of the head 
H = p(ti, . . . , t n ) are variables, and any variable occurs at most once in 
H,Li,... ,L n . (Thus the remaining occurrences of the variable are in the 
constraint C of the rule.) The proof that such (P, T) is congruent is based 
on the fact that for any model M of T if T \= ti—Ui, . . . ,t n — u n (for 
ground terms £1, . . . , t n , Ui, . . . , u n ) then a rule p(t\, . . . , t n ) <— B is in P/M 
iff p(m, . . . , u n ) B is in P/M. 

As an example, notice that the rule for w in P" (from Ex. 13.61) satisfies 
the sufficient condition, and the rules for m do not. Notice also that the 
condition is different from usually considered safeness conditions (the former 
- roughly speaking - forbids certain variable occurrences, while the latter 
require) . 

It is rather obvious how to construct programs satisfying this syntactic 
restriction, provided that the set of constraints includes equalities t — u of T. 
Instead of placing a non variable term t as an argument of the head of a rule, 
use a new variable Xt and add Xt — t to the constraint of the rule. Instead 
of writing more than one occurrences of a variable x in the rule literals of a 
rule, replace each (but one) occurrence of a; by a new distinct variable x' and 
add x' — x to the constraint of the rule. 

4 Reasoning with hybrid rules 

Now we present a way of computing the well-founded semantics of Definition 
13.21 Like in logic programming, the task is to find instances of a given goal 
formula G which are true in the well-founded semantics of a given program. 
Similarly to logic programming, our operational semantics is defined in terms 
of search trees. After introducing the operational semantics we prove its 
soundness and completeness, the latter for a restricted class of programs. 

4.1 Constraints for the operational semantics 

To construct the operational semantics we impose certain requirements on 
the external theory and the set of constraints. We need to deal explicitly 
with the syntactic equality = and its negation. So we require that = is a 
constraint predicate symbol and the external theory T includes the axioms 
CET (cf. Section 13.31) . An external theory T which does not satisfy this 
condition can be easily converted to a T which does. (T 7 may be a theory 
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without equality, or contain equality =' not satisfying CET.) Namely T is the 
union T = T U CET. Reasoning in such T can be implemented employing 
Prolog and a reasoner for T [DHM07b]. The former deals with =, the latter 
with the predicates of T . 

The operational semantics constructs new constraints using conjunction, 
disjunction, negation, and existential quantification. So we require that the 
set of constraints is closed under these operations. This imposes restrictions 
on the constraints. For instance many DLs do not allow negation of roles; for 
such DL a formula of the form r(X, Y) cannot be a constraint. The actual 
choice of constraints is outside of the scope of this paper. It depends on 
the chosen external theory and the available reasoner for it. For instance, if 
a formula C is a constraint without = then the reasoner should be able to 
check whether C is satisfiable in T 7 (where T and T' are as above). 

4.2 Operational Semantics 

The operational semantics presented below is a generalization of SLS-resol- 
ution [Prz89j, which is extended by handling constraints originating from the 
hybrid rules. It is based on the constructive negation approach presented in 
|Dra93llDra95| . In logic programming, the term constructive negation stands 
for generalizations of negation as failure (NAF) (see e.g. AIM) 11 ) . NAF pro- 
vides a way of checking whether a given negative goal is a consequence of 
the program (under a relevant semantics). Constructive negation, roughly 
speaking, finds instances of a negative goal which are consequences. The 
main contribution of the operational semantics presented here is dealing with 
hybrid programs and arbitrary external theories. The constructive negation 
method of |Dra93l IDra95j dealt with logic programs, the equality was the 
only constraint predicate and CET was the constraint theory. 

The operational semantics is similar to SLDNF- and SLS-resolution |Llo87l 
IPrz89] . For an input goal a derivation tree is constructed; its nodes are goals. 
Whenever a negative literal is selected in some node, a subsidiary derivation 
tree is constructed. So a tree of trees is obtained. 

Definition 4.1 By the restriction F\ v of a formula F to a set V of variables 
we mean the formula 3x±, . . . , x n F where x±, . . . , x n are those free variables 
of F that are not in V. By F\ F , we mean F\ v , where V are the free variables 
of formula F'. 

By a goal we mean a conjunction of the form C, L\, . . . , L n (n > 0), 
where each Lj is a rule literal and C is a constraint (the constraint of the 
goal). Consider a goal G = C, L,p(t), L' and a rule R = p(u) <— C, K, such 
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that no variable occurs both in G and R. We say that the goal 

G' = t=u,C,C',L,K,L i 

is derived from G by R, with the selected atom p(t), if the constraint 
t—u, C, C is satisfiable. 

We inductively define two kinds of derivation trees: t-trees and tu-trees. 
Their role is to find out when a given goal is t, or respectively when it is 
t or u. Informally, if a constraint C is a leaf of a t-tree with the root G 
then C implies that G is t in the well-founded semantics of the program. 
(More generally, the same holds if C is a disjunction of such leaves.) On the 
other hand, for a tu-tree we define a notion of its cross-section. If C\, . . . , C n 
are the constraints of the goals of a cross-section of a tu-tree with the root 
G then, roughly speaking, ->{C\ V ... V C n ) implies that G is f in the well- 
founded semantics of the program. A formal explanation is provided by the 
soundness theorem (14.91) in the next section. 

For correctness of the definition (to avoid circularity) we assign ranks 
to the trees. This is a standard technique employed in similar definitions 
|Llo87| IPrz89| IDra95j . In the general case ranks are countable ordinals, but 



natural numbers are sufficient for a language where the function symbols are 
constants. The children of nodes with an atom selected are defined as in 
the standard SLD-resolution. The only difference is that instead of explicit 
unification we employ equality constraints. The children of nodes with a 
negative literal selected are constructed employing the results of tu- (t-) trees 
of lower rank. A t-tree refers to tu-trees and vice versa. This is basically a 
reformulation of the corresponding definitions of [Dra93, Dra95j. 

Definition 4.2 (Operational semantics) A t-tree (tu-tree) of rank k > 
for a goal G w.r.t. a program (P, T) satisfies the following conditions. The 
nodes of the tree are (labelled by) goals. In each node a rule literal is selected, 
if such a literal exists. A node containing no rule literal is called successful, 
a branch of the tree with a successful leaf is also called successful. 



1. A constraint (C\ V • • • V C n )\ G (n > Ojj is an answer of the t-tree if 
C\, . . . ,C n are (some of the) successful leaves of the t-tree. (It is not 
required that all the successful leaves are taken.) 

2. By a cross-section (or frontier) of a tu-tree we mean a set F of tree 
nodes such that each successful branch of the tree has a node in F. Let 



If n = then by C\ V • • • V C n we mean false, and by C\ A • • • A C n we mean true. 
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F be a cross-section of the tu-tree and CF = { Ci, . . . } the constraints 
of the nodes in F. 

If CF = { Ci, . . . , C n } is finite then the constraint -i(Ci| G ), . . . , "'(Cnlc) 
(the negation of V(^Ig)) * s called a negative answer of the tu-tree. 

If CF is infinite then a constraint C which implies ~^{Ci\ G ) for each 
Ci G CF is called a negative answer of the tu-tree. Moreover it is 
required that each free variable of C is a free variable of G. 

3. If (in the t-tree or tu-tree) the selected literal A in a node C is an atom 
then, for each rule R of P, a goal derived from C with A selected by a 
variant R! of i? is a child of C, provided such a goal exists. Moreover 
it is required that no variable in R' occurs in the tree on the path from 
the root to C. 

4. Consider a node C = C, L, ->A, L' of the t-tree (tu-tree), in which the 
selected literal ->A is negative. The node is a leaf or has one child, 
under the following conditions. 

(a) If the tree is a t-tree then 

i. C is a leaf, or 

ii. G' has a child C, C, L, L', where C is a negative answer of a 
tu-tree for C, A of rank < k, and C, C is satisfiable. 

(b) If the tree is a tu-tree then 

i. C has a child C, L, L', or 

ii. C has a child C, C, L, L', where C = ->C" is the negation of 
an answer C" of a t-tree for C, A of rank < k, and C, C is 
satisfiable, or 

iii. C is a leaf and there exists an answer C" of a t-tree for C, A 
of rank < k such that -iC", C is unsatisfiable. 

An informal explanation for case [2] is that the constraints of the cross- 
section include all the cases in which G is t or u, thus their negation implies 
that G is f . A useful intuition is that adding a negative answer C to the 
nodes of the tu-tree results in a failed tree - a tree for C, G without any 
successful leaf. (For the constraint Ci of any node of the cross-section, the 
constraint C, Cj is unsatisfiable. The same holds for any node which is a 
descendant of some node of the cross-section.) 



An informal explanation for case H] is that in a t-tree (case 4(a)ii) C' 



implies that C, A is f, equivalently ->(C, A) is t. Hence C',C implies that 



A is t. In a tu-tree (4(b)ii) -iC" includes all the cases in which C, A is not 
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t. Hence -<C", C - the constraint of the child - includes all the cases in which 
A is not t, equivalently in which ->A is t or u. 

Notice that in case 4(a)i the node G' = C, L, ->A, L' may unconditionally 
be a leaf of a t-tree (of any rank). This corresponds to the fact that C = 
— iC is a negative answer for any tu-tree for C, A. (Take the cross-section 
{ C, A}). Hence in the supposed child of G' (case 4(a)ii) the constraint -iC, C 

C,L, ~ 



is unsatisfiable. Conversely, according to 4(b)i, node G' 



iA, L' 



m a 



tu-tree may have C, L, L' as the child. This corresponds to the fact that 
C" = false is an answer of any t-tree. Hence C is equivalent to -<C", C 



4(b)ii 



(which is the constraint obtained in 4(b)ii). Thus 4(b)i is a special case of 



Example 4.3 Consider a query w(c) for the hybrid program of Example 12.31 
It can be answered by the operational semantics by construction of the fol- 
lowing trees. (Sometimes we replace a constraint by an equivalent one.) 



1. A t-tree for w(c): 



w(c) 



X=c,m{X,Y),^w{Y) 

I \ 
X=c, Y=f, -.Fz(/), -™(Y) X=c, Y=d, ~^w{Y) 

X=c, Y=f, -nFi(f) X=c, Y=d, -.(X=c, Y=d, -.£(/)) 

The tree refers to negative answers derived in the cases El H] below. 
The constraint in the second leaf is equivalent to X—c, Y—d, E(f) (as 
a A -i(a A 0) is equivalent to a A -1/3). The answer obtained from 
the two leaves Ci,C 2 of the tree is 3X, Y [C\ V C 2 ). It is equivalent 
to -iFi(f) V E(f). As this constraint is a logical consequence of the 
ontology, w(c) holds in each well-founded model of the program. 

2. A tu-tree for X—c, Y=d, w(Y), employing an answer from the t-tree 
from case [3) 

X = C ,Y = d,w(Y) 
X = C ,Y = d,X' = Y, m(X', Y'), ^w(Y') 
X = c,Y = d,X' = d,Y' = e, -w(Y') 
X = C ,Y = d,X' = d,Y' = e, -.(X = c, Y = d, X' = d, Y' = e, E(f)) 
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The leaf is equivalent to X—c, Y—d, X'—d, Y'—e, ->E(f), see the expla- 
nation in the previous case. Hence from the cross-section containing 
the leaf we obtain a negative answer equivalent to ->(X=c, Y=d, ->E(f)) 
and to D = -.(X=c, Y=d) V E(f). 

Informally, D implies falsity of the root of the tu-tree. (For a formal 
treatment see Theorem 14 . 9 1 and Lemma fl~8l below) . Hence E(f) implies 
-<w(d). Formally, if Mo |= E(f) for some model Mq of T then then 
w(d) is false in WF(P/M ) (the well-founded model of P based on 
M ). 

3. A t-tree for Y' = e,w(Y') employing a negative answer from caseHJ 

Y' = e,w(Y') 

Y' = e, X" = Y', m(X", Y"), -nw{Y") 
Y> = e, X" = Y', X" = e, Y" = f, E(f), ->w{Y") 

Y' = e,X" = e,Y" = f,E(f) 

The corresponding answer is (equivalent to) Y'=e, E(f). Informally, 
the answer implies Y' = e,w(Y'). From Lemma 14.81 below it follows 
that if E(f) holds in some model Md of T then w(e) is true in the 
corresponding well founded model of P. 

Notice that if C, Y'=e is a satisfiable constraint then C may be added 
to the nodes of the tree (maybe with renaming of variables X",Y"). 
Hence C, Y'—e, E(f) is an answer for C, Y'=e, w(Y'). To construct the 
t-tree of case [2] we use C = (X=c,Y=d, X'—d). 

4. A tu-tree for Y—f,w(Y), with atom m(X',Y') selected in the leaf: 

Y=f,w(Y) 

Y=f, Y=X', m(X', Y'), ^w(Y') 

From the empty cross-section a negative answer true is obtained. So 
w(f) is false in the well-founded semantics of the program. Similarly, 
true is a negative answer for C, Y=f, w(Y), where C is an arbitrary 
constraint. 

Various simplifications of t- (tu-) trees are possible. For instance in 
case [3] of the last example the nodes of the tree may be replaced by u>(e); 
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m(e,Y),-iw(Y); E(f),->w(f); E(f). This issue is outside of the scope of 
this paper. 

We do not deal here with actual implementing of the operational seman- 
tics. (An implementation is described in |DHM07b] .) We only mention that 
- similarly as in CLP - it is not necessary to check satisfiability of the con- 
straint for each node. The answers (negative answers) of trees obtained in 
this way are logically equivalent to those of t-trees (tu-trees) from Def. 14.21 

4.3 Soundness 

In this section we prove soundness of the operational semantics of hybrid 
programs (Def. l4.2| ) with respect to their declarative semantics (Def. 13.21) . 
Before the actual proof we discuss ground instances of goals and trees, and 
introduce safe programs and goals. These notions are employed in the proof. 

For our proofs we use the characterization of the well-founded semantics 
of logic programs from Section 12.11 So for a given model M of the external 
theory, the well-founded model of the program is ^ < pi Mo {$) for some a. 

4.3.1 Ground instances of trees 

By an extension of a substitution 9 we mean any substitution of the form 
9 U 9' (where 9 = {xi/ti, . . . ,x n /t n } and 9' = {y\/u\, . . . ,y n /u n } are substi- 
tutions with disjoint domains, {x±, . . . , x n } fl {yi, . . . , y m } = 0). 

By a grounding substitution for the variables of a formula F (or just 
"for F" ) we mean a substitution replacing the free variables of F by ground 
terms. (The domain of the substitution may include other variables.) 

Let G = C, L be a goal and Mq a model of T. Let 9 be a grounding 
substitution for the variables of G. (Notice that C9 has no free variables.) 
If C9 is true in M then we say that 9 is applicable to G (w.r.t. Mq), and 
by the result G9 of applying 9 to G we mean the ground goal L9; it is called 
a ground instance of G. Similarly, we say that 9 is applicable to a rule 
H <— C,L; the result H9 <— L9 is called a normal ground instance of the 
rule0 Notice that a rule R9 is a normal ground instance of a rule R £ P 
w.r.t. M iff RB G (P/M ). 

Consider a t-tree or tu-tree Tr for G and 9 as above. A ground instance 
Tr9 of Tr w.r.t. M is defined recursively as follows. The nodes of Tr9 are 
ground instances of (some) nodes of Tr, each node H of Tr9 corresponds 

10 Definition 13.21 employs another kind of ground instance, namely H6 <— C9,L8. As 
ground instances in that sense are not used below, we sometimes skip the word "normal." 
To simplify notation, we write R9 for a normal ground instance of a rule R, when this 
does not lead to ambiguity. 
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to a node G' of Tr such that if is a ground instance of G'. The root of Tr9 is 
GO and it corresponds to the root G of Tr. If a node G'9' of Tr# corresponds 
to node G' of Tr (where 9' is a grounding substitution for the variables of 
G'), G" is a child of G 1 in Tr, 0" is an extension of 9' then G"9" is a child of 
G'9' in Tr#, provided that 9" is applicable to G" . 

A node of Tr# corresponding to a successful leaf of Tr will be called a 
successful leaf of Tr0. 

Example 4.4 Consider a program P = {p(a) <— The t-tree (or tu- 

tree) Tr for p(x) consists of two nodes; the child of p(x) is x = a,q(y). For 
9 = {x/b} the ground instance Tr9 consists of one node p(b). For a = {x/a} 
the root p(a) of Tra has a child q(t) for each ground term t. Each node g(t) 
in Tra corresponds to the node x — a, q(y) of Tr. 

Notice that if G'9' and its child G"9" in Tr9 correspond, respectively, to 
G' and G" in Tr, and G" is derived from G" by a rule R £ P then G"'^' is 
derived from G'#' by a normal ground instance Ra G (P/Mq) of i?. (This 
means that i?cx is H^B, H is an atom in G'9' and £?" is G'9' with if 
replaced by B.) Thus if a leaf G'9' of Tr# corresponds to a node G' of Tr, 
and a positive literal A is selected in G' then no normal ground instance of 
a rule in the program has the head A9. If no negative literal is selected in 
Tr then Tr9 is an SLD-tree ( |Llo87j ) for program (P/Mq). 

4.3.2 Safeness 

We introduce a notion similar to DL-safeness [MSS051 IRos06j . but taking 
into account that constraints may contain equality =. 

Definition 4.5 Let C be a constraint. A variable x is bound in C to a 

ground term t (to a variable y)ifT\=C— > x — t (respectively T |= C — > 
x = y). 

For instance, in x — f(y), y = a variable x is bound to ground term f(a), 
and y is bound to a. Notice that any variable is bound to itself indepen- 
dently of the constraint. A simple sufficient condition is that, for variables 
Xo,...,x n (n > 0) and a ground term t, if C is a conjunction of con- 
straints Cx,...,Ci and the set {Ci, . . . , Ci} contains equalities x =Xi, . . . , 
£ n _i=£„ (resp. x = Xi, . . . ,x n _i=x n ,x n =t) then x is bound to x n (resp. 
to t) in C. 

Definition 4.6 A rule R = H <— C, L, where C is the constraint of R, is 
safe if 



24 



- each variable of H, 

- each variable of a negative literal of L, and 

- each free variable of C 

is bound in C to a ground term or to a variable appearing in a positive literal 
in L. 

Let V be a set of variables. When the conditions above are satisfied with 
possible exception for the variables from V then we say that R is safe apart 
of V. 

A set of rules is safe if all its rules are safe. A hybrid program (P, T) is 
is safe if P is safe. A goal G = C, L is safe if the rule p <— G is safe (where 
p is a O-argument predicate symbol). G is safe apart of V if the rule p <— G 
is safe apart of V. 

If the root of a t-tree (tu-tree) for a safe program is safe then any node 
of the tree is safe. Hence, in the constraint of a successful leaf, all the free 
variables are bound to ground terms. In the Appendix we prove a more 
general property: 

Lemma 4.7 Let (P, T) be a safe program, V a set of variables, and G a 
goal. Consider a t-tree (or a tu-tree) with the root G. 

1. Each node of the tree is safe apart of Vo, where Vq is the set of free 
variables of G. 

2. Assume that no variable from V occurs in any variant of a rule from 
P used in constructing the tree. If G is safe apart of V then each node of 
the tree is safe apart of V. 

4.3.3 Soundness theorem 

We will say that the set of constraints has the witness property if for 
any model M of the external theory T and for any constraint C, whenever 
M \= 3C then M \= C6 for some grounding substitution 9 for C. The witness 
property is implied by the parameter names assumption (PNA) [dBPPV07| 
that restricts the interpretations of T to those in which every domain element 
is a value of a ground term. 

The operational semantics may be not sound for constraints without wit- 
ness property and non safe programs. As an example take P = {p <— q(x) } 
and T = {3x.q(x)}. Then (a constraint 3x.q(x) equivalent to) true is an 
answer of the t-tree for p. However (P, T) |^ wf p (as there exist models of T 
in which every ground instance of q(x) is false). 
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Lemma 4.8 (Soundness) Consider a program (P, T), a goal G, a model 
M of T, and a countable ordinal number k. Assume that M is a Herbrand 
interpretation, or the set of constraints has the witness property, or P is safe. 

1. If G is an answer of a t-tree of rank k for G then for any grounding 
substitution (for the variables of G) M |= CO implies ^p)| /o (0) h3 GO. 

2. If C is a negative answer of a tu-tree of rank k for G then for 
any grounding substitution (for the variables of G) M Q \= CO implies 

The proof is presented in the AppendixJ^j] It is based on studying ground 
instances of the t- (tu-) tree for G and viewing them as SLD-trees of a 
program (P/M )/ t I (respectively a program related to (P/M )/ tu I) from the 
definition of ^p/m - 

As a corollary we obtain: 

Theorem 4.9 (Soundness) Let (P, T) be a hybrid program and G = Co, L 
a goal (where Cq is the constraint of G). Assume that P is safe, or the set 
of constraints has the witness property. 

If C is an answer of a t-tree for (P, T) and G then, for any substitution 0, 
T^CO implies (P, T) |= wf L0. 

If C is a negative answer of a tu-tree for (P, T) and G then, for any 
substitution 0, T \= CO implies (P, T) |= w f 

PROOF. Let G = Go, L and G be an answer of the t-tree. T |= CO implies 
M Q \= COO' for any substitution 0' and any model M of T. Consider a 0' 
such that is ground for each free variable x of (G, From Lemma l4~8l 
applied to M , G, G, and the substitution 00', it follows that M |=3 GOO' , 
where M is the well-founded model or P based on M . Notice that GOO' = 
LOO'. As M |=3 LOO' for each 6 1 ' as above and M is a Herbrand interpretation, 
we have M |= 3 As the latter holds for each well-founded model of P, we 
obtain (P, T) |= w f L#. 

The proof for a negative answer of a tu-tree is analogical. □ 

It may be desirable to have an operational semantics which is sound also 
for non safe programs and constraints without the witness property. This 
can be obtained by employing, in formula restrictions (Def. 14.11) . a certain 
non standard quantifier 3' instead of 3. For the new quantifier it holds that 

11 From the proof it follows that safeness is needed only for the rules that have been 
used in constructing t-trees (the t-trees referred to, directly or indirectly, by the t- (tu-) 
tree for G). Alternatively, the witness property is necessary only for the constraints that 
are successful leaves of these t-trees. 
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if I \= 3'F then / |= F9 for some grounding substitution 9 for F (for any 
formula F and interpretation 7). The details are outside the scope of this 
paper. 

4.4 Completeness 

In a general case our operational semantics is not complete. Roughly speak- 
ing, the reason is using only finite constraint formulae as (negative) answers^! 
in case H] of Def . 14.21 We show completeness of our operational semantics for 
the case where the Herbrand universe is finite and the program and goals are 
safe. The completeness result includes independence from the selection rule. 

We first present a technical lemma about simplifying goals for which 
t-trees (tu-trees) are constructed. Then we restrict our considerations to 
safe programs over a finite universe, show how a kind of a most general 
(negative) answer for a given tree can be obtained, and define a notion of 
a maximal t- (tu-) tree. Intuitively, a maximal tree (of a sufficiently high 
rank) derives everything that is required by the declarative semantics. This 
is made formal in a completeness lemma, from which completeness of the 
operational semantics follows. 

The following lemma shows how a tree for A may replace a tree for C, A. 
So, for a fixed A, many trees for goals C, A may be replaced by a single tree. 
(On the other hand, for a fixed C the tree for A may have more nodes than 
the corresponding one for C, A.) 

Lemma 4.10 Consider a program (P,T). Let A be an atom and C a 
constraint. If ~>(Ci\ A ), . . . , ~^(C n \ A ) is a negative answer of a tu-tree (re- 
spectively a negation of an answer of a t-tree) of rank k for A then 
-.(C, (CxIJ), • • • , -(C, (C n U), or equivalents -.C V -.(Ci|J, . . . , -(C„| A ), 
is a negative answer of a tu-tree (the negation of an answer of a t-tree) of 
rank k for C, A. 

PROOF. Without loss of generality we can assume that if a variable x occurs 
both in the tree for A and in C then x occurs in A. If C, L is a node of the 
tree for A then C, C, L is a node of the tree for C, A provided that C, C is 
satisfiable. Assume the negative answer for A is obtained from a finite cross- 
section. Consider a "corresponding" cross-section of the tree for C, A whose 
node constraints are those of C, that are satisfiable. The corresponding 

12 In [Dra93, Dra95] this problem was solved by allowing an infinite set of children 
of a node with a negative literal selected. Example 4.10 in [Dra93j shows that this is 
actually necessary. Thus it provides a counterexample for completeness of the operational 
semantics of Def. 14.21 
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negative answer is _i ((C, Ci)\ A ), . . . , _i ((C, C n )| A ). Each (C, Ci)\ c A is equiv- 
alent to C, (Ci\ A ). The cases of an infinite cross-section, and of a t-tree, are 
similar. □ 



4.4.1 Maximal trees 

For this section we assume that the Herbrand universe is finite. Hence the 
alphabet of function symbols is finite and contains only constants. 

A t- (tu-) tree may be infinite. Moreover, the set of nodes in the tree 
with a negative literal selected may be infinite. Hence the tree may refer to 
an infinite set of subsidiary trees. The answers (negative answers) of the tree 
may be obtained from an infinite set of successful leaves (an infinite cross- 
section); so it seems that we have to deal with an infinite set of answers (as 
there may not exists one which implies all the others). In what follows we 
show how to avoid infinite sets of answers. 

Notice first that the set of selected negative literals in a t- (tu-) tree is 
finite, up to renaming of variables (as the Herbrand universe is finite). By 
Lemma 14.101 instead of constructing a possibly infinite set of subsidiary tu- 
(t-) trees for goals of the form C, A, it is sufficient to construct a finite set of 
tu- (t-) trees for goals of the form A. 

In a t- (tu-) tree with a safe root, if C is a successful leaf then each 
free variable x of C is bound to a constant c x . This defines a grounding 
substitution 9 = {xi/c Xl , . . . , x m /c Xm } for the free variables xi,...,x m of 
C. Now C is equivalent to x\ — c xi , . . . , x m — c Xm , C and to X\ — c xi , . . . , 

■Em Qcm' CO. 

By a grounded constraint from a program P (from a goal G) we mean 
a constraint Co such that constraint C is the constraint of a rule of P (the 
constraint of G) and the substitution a replaces all the free variables of C by 
constants. A constraint is in a solved form for a program P and an initial 
goal G if it is a conjunction of constraints of the form x = c, C or -iC , 
where x is a variable, c is a constant, and C is a grounded constraint from 
P or G. 

The set of grounded constraints from a given program or a given goal is 
finite. So the set of constrains in solved form (for P and G) with the free 
variables from G is finite, up to equivalence. 

We now show that, under certain conditions, the successful leaves of t- 
(tu-) trees may be seen as disjunctions of constraints in solved form. 

Lemma 4.11 Let the Herbrand universe be finite and Tr be a t-tree (tu- 
tree) for a safe goal G and a safe program (P,T). Assume that each neg- 
ative answer (negation of an answer) employed in the tree is of the form 
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-iCi, . . . , -iC„, where each Cj is in solved form for P and G. Then the con- 
straint of any success leaf C of the tree is equivalent to a disjunction of 
constraints in solved form. Also, C\ G is equivalent to a disjunction of con- 
straints in solved form for P and G. 

PROOF. Consider a success leaf C. Each free variable x of C is bound in 
C to a constant c x . Let = {xi/c xi , . . . , x m /c Xm } (where xi, . . . , x m are the 
free variables of C). C is equivalent to x\ = c xi ,...,x m = c Xm ,C and to 
%i — c Xl , . . . , x m = c Xm , C9 {CO has no free variables). 

CO = Ci, . . . ,C n , where each Cj is a ground equality, or a grounded 
constraint from P or from G, or a ->C^ where C[ is a constraint in a solved 
form. -iCj- is equivalent to —*D\ V ... V -i-D;, where each Di is of the form 
x = c or is a possibly negated grounded constraint from P or from G. Thus 
-iC-6 1 is equivalent to true (if some ground disequality ->DiO is true), or to 
-i-Djj V ... V ->Dj , , where each Dj i is a possibly negated grounded constraint 
from P or G. Each Cj which is a ground equality is equivalent to true (as it 
is satisfiable). 

So applying distributivity (0, (-0 V if/) = 0, ^ V 0, -0') to ii = c Xl , . . . , 
— c x m ,C0 results in an equivalent disjunction of constraints in solved 
form. Removing from the latter constraint each Xi = Ci where Xi not free in 
G produces a disjunction of constraints in solved form equivalent to C\ G . □ 

As discussed above, the set of disjunctions of constraints in solved form 
(for P and G) with the free variables from G is finite, up to equivalence. 
Thus from the lemma it follows that the set of success leaves of the t-tree 
(tu-tree) is equivalent to a finite set of disjunctions of constraints in solved 
form. Formally: If the tree satisfies the conditions of Lemma T4. Ill then there 
exists a constraint D (we will call it a finite answer of the t-tree, resp. 
finite pseudo-answer of the tu-tree), such that 

M |= C\ G for some success leaf C of the tree iff M \= DO, 

for every model M of T and every substitution grounding the variables 
of G. Moreover, D is a disjunction of constraints in solved form, for P and 
the root G of the tree. {D is equivalent to a constraint (Ci| G ) V ... V {C m \ G ) 
where each Cj is a success leaf of the tree.) 

Notice that the negation of a pseudo-answer of a tu-tree is a negative 
answer of the tree. (The corresponding cross-section contains all the success- 
ful leaves.) Informally, a finite answer is a most general answer that can be 
obtained from a given t-tree, and the negation of a finite pseudo-answer is a 
most general negative answer that can be obtained from a given tu-tree. 
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Now for a given rank and goal we define a maximal t- (tu-) tree. The 
intention is that the tree provides a (negative) answer which is more general 
than other (negative) answers for this goal of the same rank. 

Definition 4.12 Let the Herbrand universe be finite, and (P,T) be a safe 
program. A maximal t-tree (tu-tree) for a goal and (P, T) is defined induc- 
tively: 

A maximal t-tree of rank is a t-tree in which each node with a negative 
literal selected is a leaf. 

A maximal tu-tree of rank is a tu-tree in which if a negative literal is 
selected in a node G' and the constraint of G' is C then the constraint of the 



child of G' is C (cf. Def. Urease 4(b)i) 



A maximal t-tree (tu-tree) of rank k > is a t-tree (tu-tree) in which 
a node G' with a negative literal ->A selected has a child G" iff C, ->D is 
satisfiable, where C is the constraint of G' and D is a finite pseudo- answer 
(a finite answer) of a maximal tu-tree (t-tree) for A of rank k — 1; moreover 
C, ->D is the constraint of G"@ 

The definition is correct: From Lemma 14.101 and 14.111 by induction on 
the rank we obtain that, for a safe program P and a safe goal G, the tree 
described in the definition satisfies the conditions of Lemma 14.111 it has a 
finite (pseudo-) answer which is a disjunction of constraints in solved form 1^*1 
hence the finite (pseudo-) answers employed in the definition exist. Notice 
that a maximal t- (tu-) tree is defined for any safe program and any goal Go. 
If Gq is safe then the tree has a finite (pseudo) answer. 



4.4.2 Completeness theorem 

By a selection rule we mean a function which, given a sequence of goals 
Go, • • • , G n , selects a rule literal in G n provided G n contains at least one rule 
literal and Go, . . . , G n is a prefix of a branch of a t-, tu-, or SLD-tree. 

Now we are ready to state and prove completeness of the operational 
semantics. We first show that a (negative) answer of any maximal (tu-) t- 
tree for G of rank j - speaking informally - describes all the instances of 
G that are true (false) in a corresponding approximation \I/|y Mo (0) of the 
well-founded model of P/M . 

13 By Lemma r4.1Q[ V -iD is a negative answer (the negation of an answer) for C, A. 
Hence according to Def. 14.21 the constraint of the child of G is C, (->C V -*D), which is 
equivalent to C, -*D. 

14 p or p anc [ q if q cons i s ts of a single atom A then the constraints are in solved form 
for P and any goal. 
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Lemma 4.13 (Completeness) Assume that the Herbrand universe is fi- 
nite. Consider a safe program (P,T). Let G be a safe goal, C be the 
constraint of G, and k > be a natural number. Consider a selection rule 
TZ, and a maximal t-tree and a maximal tu-tree for G of rank A; — 1 if G does 
not contain a negative literal, and of rank k otherwise. 

Let M be a model of T, and 6* be a grounding substitution for the 
variables of G such that M |= Co^ 

1. If x I / p/ Mo (0) |=3 C6 1 and C is a finite answer of the t-tree then M |= C6. 

2. If ^p/ Mo (0) |=3 ""C^ and D is a finite pseudo-answer of the tu-tree 
(and thus ->D is a negative answer) then Mq |= ->D0. 

The basic idea of the proof is that in case 1. it follows that there exists 
a successful SLD-derivation for GO and a certain program (P / Mq) / t J . This 
derivation is shown to be a branch of a ground instance of the maximal t-tree 
for G. In case 2. there does not exist a successful SLD-derivation for GO and 
a certain program (P/M )/ tu J. However such a derivation is shown to be a 
branch of a ground instance of the maximal tu-tree for G, under assumption 
that Mo |= DO. Hence Mq \= -<D6. The detailed proof is given in the 
Appendix. 

As a main result we obtain completeness and independence from the 
selection rule (of the operational semantics of Def. 14.21 w.r.t. the declarative 
semantics of Def. 13 . 21) . 

Theorem 4.14 (Completeness) Assume that the Herbrand universe is fi- 
nite. Consider a safe program (P, T), a safe goal G = Co, L (where Co is the 
constraint of G), and a selection rule TZ. Let be a grounding substitution 
for the variables of G such that CqO is satisfiable. 

1. If (P, T) |= w f LO then there exists a t-tree (of a finite rank) for G via 
TZ with an answer C such that T \= CO. 

2. If (P, T) |= w f -iLO then there exists a tu-tree (of a finite rank) for G 
via TZ with a negative answer C such that T |= CO. 

PROOF. As the Herbrand universe is finite, the set of possible programs 
P/M is finite. Thus there exists a natural number k such that ^p/m ($) 
is the well founded model of P/Mq for each Mq. Now the Theorem follows 
from Lemma [4. 131 □ 

5 Decidability 

Now we show that the well-founded semantics for hybrid programs is decid- 
able in the case of Datalog, i.e. when the set T of function symbols is a finite 
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set of constants. (No safeness condition is needed.) The proof employs the 
soundness and completeness results from the previous sections. 

Theorem 5.1 (Decidability) Assume that the Herbrand universe is finite, 
and that for any closed constraint C it is decidable whether T \= C. There 
exists an algorithm which, for a hybrid program (P, T) and a ground atom A, 
finds out whether (P, T) |= wf A and whether (P, T) |= wf ->A. 

PROOF. We first show that each maximal t- and tu-tree for program 
(ground(P) , T) can be represented and constructed in a finite way. 

We say that a constraint is in ground solved form if it is built out of the 
constraints of the rules of ground(P) by means of -> and A. The set of such 
constraints up to logical equivalence is finite. (Convert them to a disjunctive 
normal form, remove repeated literals in the conjunctions, remove repeated 
conjunctions.) 

Let B be a ground rule atom, and Tr be a t-tree or a tu-tree for B. Each 
satisfiable equality constraint that appears in the tree is of the form a = a 
(where a e J 7 ), hence it is valid and may be removed. Assume that each 
lower rank (negative) answer employed in the tree is in ground solved form. 
Then each constraint in the tree is in ground solved form. So the set of these 
constraints is finite up to equivalence. 

The set of rule literals that appear in Tr is a subset of the literals of 
ground(P) . Thus the set is finite. So the set of conjunctions of such literals 
is finite, up to logical equivalence. (Repeated occurrences of a literal can be 
removed.) As a result we obtain that the set of nodes of Tr is finite up to 
logical equivalence. 

Assume now that the (negative) answers from lower rank trees employed 
in Tr are known. Then a finite representation of Tr can be constructed 
top-down starting from the root B. Before adding a node G to the current 
(sub-graph of the) tree, it is checked whether a node G' logically equivalent 
to G already exists. If it does and the same literal is selected in both nodes 
then G is not added. This process terminates, its result is a tree Tr 1 which 
is a finite sub-graph of Tr. Each successful leaf of Tr is (logically equivalent 
to) a successful leaf of Tr' . In the terminology of the previous section, the 
disjunction of the successful leaves of Tr' is a finite (pseudo-) answer of Tr 
(and is logically equivalent to each finite (pseudo-) answer of Tr). 

In this way finite representations of the maximal t- and tu-trees for all 
the atoms can be constructed, first for rank and then stepwise for ranks 
1,2, ... . The process is terminated at rank k when for each atom A the 
obtained finite (pseudo-) answers of rank k — 1 and k are equivalent (as it is 
then a finite (pseudo-) answer for A of any rank > k). 
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The well-founded models of P and those of ground(P) are the same; 
hence the programs are equivalent: (P,T) |= wf F iff (ground(P),T) |= wf F 
for any formula F . As ground(P) is finite and safe, the completeness lemma 
(14.13p applies. Thus, for any selection rule 1Z, 

• if (P, T) |= w f A then T \= C, for some finite k > and any maximal 
t-tree of rank > k for A and (ground(P) , T), with a finite answer C, 

• if (P, T) |= w f -iv4 then T |= -lD, for some finite k > and any maximal 
tu-tree of rank > k for A and (ground(P),T), with a finite pseudo- 
answer D. 

Hence, by the soundness theorem (14.91) . (P,T) (= w f A iff T (= C (respec- 
tively (P, T) |= w f -ij4 iff T |= -iD) for the finite answer C (pseudo-answer 
D) for A computed above. Checking whether T |= ->C and T |= -i£) is 
decidable by the assumptions of the theorem. □ 

The decidability result should be compared with the fact that under the 
assumption of Th. 15.11 it is undecidable whether P U T |= A [LR98] . The 
difference is that the logical consequence deals with arbitrary interpretation 
domains, while the semantics of hybrid programs interprets P over the Her- 
brand universe, which in this case is finite. 

6 Related Work 

The notions of external theory and constraints used in hybrid rules are sim- 
ilar to those used in CLP. However, classical CLP does not support non- 
monotonic reasoning. The results presented in this paper can thus also be 
seen as an approach to integration of both paradigms, based on the well- 
founded semantics of normal programs. 

We are aware of few papers on non monotonic negation for CLP. The 
approach of |Fag97| employs the 3-valued completion semantics. It is similar 
to the work presented here; in both cases the operational semantics follows 
the idea of |Dra95j of selecting a cross section of a derivation tree and negating 
the disjunction of respective constraints. The completion semantics is also 
used in |Stu95j . An approach generalizing the well-founded semantics for 
CLP is presented in |DS98] . It however assigns the semantics only to some 
programs (those that can be transformed to an irreducible program), while 
our semantics deals with all programs. In contrast to our approach, the 
operational semantics of [DS98J is not top-down and goal driven. In consists 
of applying program transformations to obtain an irreducible program; the 
latter can be used to obtain answers to goals. 
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The presented framework makes it possible to integrate normal logic pro- 
grams with ontologies expressed as first-order theories, including those spec- 
ified in the web ontology languages OWL-DL and OWL-Lite. 

The problem of integration of rules and ontologies has been addressed 
in different ways. One line of research aims at achieving the integration 
by embedding rules, ontologies and their combinations in a known logic. 
A well-known proposal of this kind is SWRL [HPS04J, extending ontolo- 
gies with Horn formulae within FOL, but not allowing non-monotonic rules. 
More recent attempts [dBET08| IMR071 ldBPPV07] address the issue of non- 
monotonicity by embedding non-monotonic rules and DL ontologies in vari- 
ous logics which make it possible to capture non-monotonicity. 

In contrast to that, we achieve the integration by a direct definition of 
the semantics of hybrid rules. The declarative semantics combines the FOL 
semantics of the external theories with the well-founded semantics of logic 
programs. Negation in the constraints of the external theory is interpreted 
in the classical way, while negation in rule literals is non-monotonic. We now 
compare our work with the approaches to integration of rules and ontolo- 
gies which make similar assumptions. We note first that all related work of 
this kind is based on Datalog rules, while our approach admits non-nullary 
function symbols. 

Our work is strongly motivated by the early ACTog approach [DLNS98J 
where positive Datalog was extended by allowing the concepts of ACC DL as 
constraints in safe Datalog rules. The operational semantics of AC-log relies 
on an extension of SLD-resolution where the disjunction of constraints from 
different derivations is to be submitted for validity check to the DL-reasoner. 
We adopted the ACTog idea of extending rules with constraints in the body, 
and applied it to more expressive rules including non-monotonic negation, 
and to arbitrary external theories of FOL. 

In our approach the heads of the hybrid rules are atoms built with rule 
predicates. Thus the semantics of the rule predicates depends on the external 
theory which is assumed to be given a priori and not to depend on the 
rules. The rationale for that is that the rules describe a specific application 
while the theory (for example an ontology) provides a knowledge common for 
an application domain. In contrast to that, several papers [MSS051 IRos05j 
IRos06| allow the use of ontology predicates in the heads of rules, defining 
thus an integrated language where rule predicates and ontology predicates 
may be mutually dependent, and ontology predicates can be (re-) defined by 
rules. 

The paper [MSS05J defines DL rules, a decidable combination of OWL- 
DL with disjunctive Datalog without non-monotonic negation. In contrast 
to that, our primary concern is non-monotonic reasoning. 
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The r- hybrid knowledge bases [Ros05j and the more recent VC+log |Ros06j 
are based on disjunctive Datalog with non-monotonic negation under the sta- 
ble model semantics. The objective is to define a generic integration scheme 
of this variant of Datalog with an arbitrary Description Logic. The DL-rules 
defined under this scheme may include DL predicates not only in their bodies 
but also in the heads. A hybrid VC+log knowledge base consists of a DL 
knowledge base /C and a set of hybrid rules V . A notion of non-monotonic 
model of such a knowledge base is defined by referring to first-order modelJ^l 
of /C and to the stable models of disjunctive Datalog. This is similar to our 
definition of declarative semantics in that models of K, are used to transform 
the set of grounded hybrid rules into a set of ground Datalog rules, not in- 
cluding DL-atoms. However, as the heads of the hybrid rules may include 
DL-atoms, the transformation is more elaborate than our V/Mq transfor- 
mation. Also the semantics of VC+log is based on stable models of the 
transformed ground rules, while our semantics is based on the well-founded 
semantics of V/M . For stratified normal logic programs the stable model 
semantics is equivalent to the well-founded semantics JAB94] . Thus for strat- 
ified sets of rules VC+log coincides with our approach (provided the rules 
are non disjunctive, and without DL-atoms in their heads, and the external 
theory satisfies the requirements of VC+log). 

The proposed reasoning algorithm (NMSAT-VC+log) works bottom- up 
and is based on grounding. Our operational semantics works top-down and 
does not require grounding. Decidability of VC+log is achieved by a weak 
safeness condition. The condition is similar to that in our approach. However 
here it is not needed for decidability (but for completeness of the operational 
semantics; it is also one of alternative sufficient conditions for soundness). 

The language of Description Logic Programs (dl-programs) |ELST04aJ 
integrates OWL DL with Datalog rules with negation. This is done by al- 
lowing in the bodies so called dl-queries to a given ontology. The queries 
may locally modify the ontology. Two kinds of declarative semantics are 
considered for the integrated language. The semantics of choice extends the 
stable model semantics of Datalog with neg atioi0 [ELST04 a] but an exten- 
sion of the well-founded semantics is also considered |ELST04b] . In both 
variants of the declarative semantics the truth value of a rule w.r.t. to an 
interpretation depends on dl-queries in the rule being logical consequences 



15 However, only a fixed domain of interpretation is considered, with a fixed interpre- 
tation of constants. Moreover, the interpretation is a bijection (each domain element is 
denoted by a distinct constant), and the domain is countably infinite. 

16 More recent versions of this work are based on disjunctive Datalog with negation. 
A further extension is HEX-programs EIST06J, where external atoms are used to model 
interface with arbitrary external computations. 
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of the respective ontologies. This makes the semantics incompatible with 
the standard semantics of the first order logic. For example consider rules 
P = {p <— Qi; p <— Q2 }, where none of DL-formulae Q\,Q2 is a logical 
consequence of the ontology T, but in each model of T at least one of them 
is true. Then p is a logical consequence of P U T, but will not follow from 
(P, T) represented as a dl-program. In contrast to that, our approach is com- 
patible with FOL, in the sense explained in Section I3~2l In the last example, 
p follows from the hybrid program (P, T). 

7 Conclusions 

We presented a framework for integration of normal logic programs under the 
well-founded semantics with first-order theories. Syntactically the integration 
is achieved by extending the bodies of normal clauses with (certain) formulae 
of a given external theory, resulting in the notions of hybrid rule and hybrid 
program. It is assumed that the theories are external sources of knowledge 
and are not modified during the integration. Therefore it is required that the 
predicates of the extended normal program and the predicates of the external 
theory are distinct. 

The main contributions of this work are: 

• The declarative semantics of hybrid programs, combining the (3- valued) 
well-founded semantics of normal programs with the (2- valued) logical 
semantics of the external theory. It combines non-monotonic negation 
of the well-founded semantics with the classical negation of FOL. It 
allows non constant function symbols. It contrast to most of related 
approaches it is defined for arbitrary external FOL theories. 

The declarative semantics is undecidable, however it is decidable for 
Datalog hybrid programs with decidable external theories. In the spe- 
cial case of a hybrid program (P, T) where V does not include negation, 
the declarative semantics is compatible with the semantics of FOL, in 
the sense explained in Section 13.21 The semantics makes possible rea- 
soning by cases (cf. Ex. 12. 3j) . 

• The operational semantics that describes how to answer (not necessar- 
ily ground) conjunctive queries. This includes handling of non-ground 
negative queries by combination of the ideas of CLP [MSW06J and con- 
structive negation of |Dra95j . The operational semantics can be seen 
as an extension of SLS-resolution with handling of non-ground negative 
queries and constraints of the external theory. It can be implemented 
by compilation to Prolog [DHM07bJ, and employing an LP reasoner 
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for the well-founded semantics and a reasoner for the external theory. 
So the implementation requires rather small efforts thanks to re-using 
the existing reasoners. A prototype implementation of this kind is de- 
scribed in |DHM07b]. It allows non constant function symbols, the 
external theories are OWL ontologies. It uses XSB Prolog [SSW+071 
as an LP engine, and can use various OWL reasoners. It is efficient, in 
the sense that the number of queries to the OWL reasoner is small. 

• Soundness and completeness results: the operational semantics was 
shown to be sound w.r.t. the declarative semantics. It is complete (and 
independent from the selection rule) for safe hybrid programs where 
the function symbols are constants. 

The external theory can be a theory of a constraint domain of CLP. 
Thus our framework additionally provides a method of adding non-monotonic 
negation to CLP, in a way generalizing the well-founded semantics. 

It contrast to most of related work, our approach works for arbitrary 
external FOL theories. There are no restrictions on the alphabet of function 
symbols (it may be finite or infinite). We do not impose any conditions on 
the equality in the external theories, like unique name assumption (UNA) 
or freeness axioms (CET). There are certain requirements related to the 
operational semantics; however we show how an arbitrary external theory T 
can be extended to a theory T satisfying the requirements. 

This paper does not study implementation techniques. They are subject 
of future work, begun in [DHM07b] . 
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Appendix 

Here we present proofs of Lemma 14.71 and of two main technical lemmas 
from Sections 14.31 and 14.41 By var(F) we denote the set of free variables of 
a formula F. 

Proof of Lemma 14.71 We first prove part 2 by induction. Then part 1 
follows immediately. Assume that a node Gi is safe apart of V. We show 
that each its child is safe apart of V. 
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Let Gi = C, L, p(t), L' and its child G i+ i = t—u, C, C, L, K, L' be derived 
from Gi by a rule R = p(u) <— C, K. Let N be the negative literals of L, L', 
and N' be the negative literals of K. Each free variable of u, C, N', is bound 
in C to a ground term or to a variable occurring in a positive literal of K (as 
R is safe). Each variable from var(C, N) \ V is bound in C to a ground term 
or to a variable occurring in a positive literal of L, L' or in t (as Gj is safe 
apart of V). Hence each variable from var(C, N,t=u,C , N') \ V is bound 
in C, t—u, C to a ground term or to a variable occurring in a positive literal 
of K, L, I 7 . _ _ 

Let Gi = C, L, -i A, L' with a negative literal —>A selected, with a child 
G i+1 = C,C',L,T7. We have var(C) C mr(C,A), thus wr(C,C) \ V C 
var(C, A) \ V. Thus each variable from t>ar(C, C") \ V is bound in C, C to 
a ground term or to a variable occurring in a positive literal of L, L' . As the 
negative literals of Gj+i are those of Gj, we obtain that Gj+i is safe apart of 
7. □ 

The proofs below refer to ground programs of the form P/M Q (cf. Def. 13.21) . 
and the operator defining the well-founded semantics of normal logic pro- 
grams (cf. Section I2TTI) : 

*P/Mo(I) = {M {P /Mo)/ t I n^)U ->(H \ M iP/ Mo)/ tu l) 

Remember that from monotonicity of ^p/m it follows that k' < k implies 

Proof of Soundness Lemma 14.81 By transfinite induction on the rank of 
the t-tree and tu-tree. Assume that the lemma holds for t-trees and tu-trees 
of rank < k. We can assume that 9 binds only the free variables of G. 

1. Assume that C is an answer of a t-tree for G of rank k, and that 
Mo |= C9. Consider the branch G\, . . . ,G m of the tree from the root G 
to the leaf C (i.e. G x = G,G m = C) such that C = (. . . V C V . . .)\ G and 
Mo |= (C'\ G )6. We show that some ground instance of CO is true in Mo. This 
is obvious when M is a Herbrand interpretation or when the set of constraints 
has the witness property. If P is safe then C is safe apart from the set 
Vq = var(G), by Lemma 14771 So each variable x G var(C')\V is bound in C 
to a ground term t x . Consider a substitution ip = {x/t x | x e var{C') \ Vq }. 
Formula C'\ G is equivalent to (C'<p)\ G and to C'tp. Hence M |= C'ip9. The 
latter formula is a ground instance of C"0, as ip8 = dip. 

Thus substitution 6 can be extended to a substitution 6 1 ' such that M |= 
C"^', and GO' = GO. Notice that 0' is applicable to all the goals in the 
branch (as for any such goal its constraint is of the form C\, . . . , Ci, where 
C' = C u ...,C n , l<n). 
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The sequence GiO', . . . , G m 0' is a successful branch of an instance of the 
considered t-tree for G. If a positive literal is selected in Gi then G i+ i9' is 
derived from GiO by a rule from (P/Mq). Whenever a negative literal —>A is 
selected in Gi and Cj is the constraint of Gi then the constraint of Gj+i is 
Cj, C", where C" is a negative answer of a tu-tree of rank j < k for Cj, A. 
As 6*' is applicable to Gj + i, we have M |= (Ci,C")6'. By the inductive 
assumption, ^^^ o (0) (=3 ~~<A6'. The same holds for any k' > j. 

Thus the sequence GiO', . . . , G m 9' is a successful SLD- derivation for GO' 
and the program P t = (P/M o )/^p / H A ^ o (0) for some fc' < k. (If fc = then P t = 
P/Mq.) Hence M.p t \= G9, by soundness of SLD-resolution |Apt97| . By the 
definition of ^ p/m i f°r eacn positive literal A' of GO we have ^ P/ ^ o (0) |=3 A'. 
For each negative literal ->A' of G6» we have ->A' G P t , i.e. * P '/^ O (0) |=3 ->A'. 
By monotonicity of ^p/m for each literal L of GO we have ^p~/j^ o (0) |=3 L. 
Hence ^(0) N GO. 

2. Consider a tu-tree Tr of rank for G. Let the negative answer C be 
obtained from Tr and its cross-section F. Let CF be the constraints of the 
nodes of F. So C implies -| (Cj| G ) for each Cj G CF. Assume that M |= C#. 
Then M \f Cft ', for any extension 0' of grounding the variables of Cj 
and for each Cj G CF. Thus, in a ground instance TrO of the tree, no node 
corresponds to a node of F (nor to a descendant of a node of F). Hence no 
leaf of TrO is successful. 

Out of TrO we construct an SLD-tree with root GO for a certain ground 
program. The tree has no success leaves, hence GO is false in the least 
Herbrand model of the program (by completeness of SLD-resolution). 

In TrO, consider a leaf G'O' corresponding to a node G 1 = (C, L, -iA, U) 
of Tr, with a negative literal —>A selected. The node G' does not have a 
child C',L,L' (such a child implies that G'O' is not a leaf). Hence case 



4(b)ii or 4(b)iii has been applied to the node. Thus there exists an answer 



C" of a t-tree for C',A of rank k' < k. Moreover (^C",C')0' is false in 
M (otherwise ^C",C is satisfiable, G' has a child -.C", C, Z, 77, and G'0' 
has a child LO' JJO'). Hence ^C"#' is false in M (as CO' is true in M ). 
From Mo |= C"^' and from the inductive assumption we obtain $ pt^ (0) |=3 
AO' . By monotonicity of *&p/m the same holds for any k" > k'. Thus 
-nM' (P/Mo)/ te * P '+J o (0), by the definition of As fc' + 1 < k, we 
have ->A0' £ (P/M o )/ to * P/Mo (0). Hence the node G'^' (with -iA6*' selected) 
is a leaf of an SLD-tree for program (P/M o )/t u \l/p / / Mo (0). (Notice that this 
reasoning is also valid for a non finite k. In particular, k! + 1 < k if k is a 
limit ordinal.) 

If a non-leaf node G'6 1 ' in TrO corresponds to G' = (C',L,^A G >,L') of 
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Tr, with -iAqi selected, then the child of G'9' is (L, L')6'. For each such 
node G'9' of Tr9, let us add ^A G i9' to the child and all its descendants. The 
obtained tree is an SLD-tree for G9 and a program P tu U P V oid, where Pt u = 
(P/M o )/t u ^/py Mo (0), and each clause of P VO id is of the form ->A <— ->A. Adding 
-Pvoid to a definite clause logic program does not change its least Herbrand 
model. The obtained tree does not have a success node. By completeness of 
SLD-resolution |Apt97| A4p tu \f G9, and for some literal L of G9 we have 
L£M Ptu . 

If L is of the form ->A then A G ^p/ Mo (0) (by the definition of / tu ), and 
L is f in * F/Mo (0). Otherwise LeH\M Ptu and ->L G *p/i o (0), hence L is 
f in *{,)i o (0). In both cases L is f in ^ P )] /f) (0), thus ^^(0) hs □ 

Proof of Completeness Lemma H7T51 If # P/Mo (0) hs ^ or # P/Mo (0) 
|=3 -1G6 1 then we say that k — 1 is a level of G# when G does not contain 
a negative literal and k is a level of G9 when G contains a negative literal. 
The proof is by induction on a level of G9; the rank of the constructed t- or 
tu-tree for G is the level of G9. 

Let G9 be of level k' G {k - 1, Let us denote / = ^ p ~/ 1 Mq ($) and 

J = \I/p'/ A/o (0). So J = \E'p/Af (/) if G contains a negative literal, and J = I 
otherwise. In both cases I C J. 

1. Assume ^ r P /jvf o (0) |=3 C^. Each positive literal A in G9 is a mem- 
ber of M.(p/M )/ t i (by the definition of and hence of M.(p/M )/ t J- F° r 
each negative literal ->A in we have J [=3 -iA (as then ^p/ Mo ($) = J)- 
Thus -iA G (P/M )/ t J, by the definition of / t . So, by completeness of SLD- 
resolution, for any selection rule there exists a successful SLD-derivation T> 
for the goal G9 and program (P/M )/ t J. 

We now show how negative literals in the derivation are related to lower 
rank tu-trees. Consider a negative literal ->A occurring in T>. We have 
^A G (P/M )/ t J (i.e. ^A G J), so A is of level k' - 1. By the inductive 
assumption, for any goal B such that A = B9' is a ground instance of B, if 
C is the negation of a finite pseudo-answer of a maximal tu-tree for B via 
K of rank jfe' - 1 then M (= C"0'. 

Consider a maximal t-tree Tr of rank k' for G and (P, T) via Let Tr9 
be a ground instance of Tr. It can be seen as an SLD-tree via a selection rule 
1Z', for a certain ground definite program containing the rules of P/M and 
some facts of the form ->A. For 1Z' there exists a derivation X> as above. We 
show that D is a branch of Tr9. We prove by induction on i that the i-th goal 
of P is a node of Tr9. Let a goal of T 5 be a node of Tr9, corresponding 
to a node G' of Tr. If a positive literal is selected in G'9' and in G' then 
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any goal derived from CO' by a rule from P/M is a ground instance G"Q" 
of a child G" of G' in Tr, where 0" is an extension of 0'. If a negative literal 
-u4 is selected in G'9' then a -if? is selected in G 1 and -i^4 = ->B0'. The 
tree Tr refers to a maximal tu-tree for B, and to the negation C of its finite 
pseudo-answer. As explained in the previous paragraph, Mq \= C'9' follows 
from the inductive assumption. The child G" of G' in Tr is G' with -<Bi 
removed and C added. So substitution 9' is applicable to G", and G"9' is 
G'9' with ->Ai removed. Thus the successor of G'9' in T> is the child G"9' of 
G'0' in Tr9. 

The last goal of T> is empty, it is a ground instance of a leaf G s of Tr. 
Thus M \= (C S 9)\ G . Let C be a finite answer of the t-tree. M |= CO. 

2. Assume that \I/p , Mq (0) |=3 -iG0. Some literal L of G# is false in 
*p/a/ o ( ) = *p/m o 0O- If ^ = ^4 is an atom then ->A G ®p/m (I) Q ®p/m (J)- 
Hence A G" M.(p/M )/ tu j (by the definition of ^p/m )- If L = ->A is a negative 
literal then A G ^ P / Mo {I) = J- Hence ~^A G" (P/M )/ tu J (by the definition of 
/t u ). In both cases, L G" •M.(p/M )/ tu j- Thus, by soundness of SLD-resolution, 
there does not exist a successful SLD-derivation for GO and the program 
(P/M )/ tu J (as otherwise L G •M(p/M )/t u Jj contradiction). 

Consider a maximal tu-tree Tr of rank k' for G via 7?-, with a pseudo- 
answer D. We have to show that M \= ->D0. Assume the contrary; then Tr 
has a success leaf C s such that Mq \= (C S \ G )0. Consider a branch of Tr from 
G to C s . Let Gi, . . . , Gi be the goals of the branch (G = G\, Gi = C s ). As 
P is safe, by Lemma 14.71 there exists a substitution 0' which is an extension 
of and is applicable to the nodes of the branch. (See part 1 of the proof 
of Lemma H3] for details.) We show that GiO', . . . , G[0' is a successful SLD- 
derivation for GO and (P/M )/ tu J, which is a contradiction. 

If a positive literal is selected in Gi and a rule r G P is applied to Gj to 
obtain Gj + i then Gi + \0' is obtained from Gj0' by applying a normal ground 
instance ra G (P/M )/ tu J of r. Assume a negative literal -<S is selected in 
Gj. The constraint of Gj + i is equivalent to Gj, -iG" where Gj is the constraint 
of Gj and C" is a finite answer of a maximal t-tree of rank k' — 1 for B. We 
have M |= ~^C"0', as 6 1 ' is applicable to Gj + i. Suppose -B6 1 ' G J. This leads 
to contradiction: such BO' is of level k' — 1 and, by the inductive assumption, 
the answer C" satisfies M |= C"0'. Thus J and ^B0' G (P/M )/ tu J. 
Thus G i+ iO' is obtained from Gj^' by applying a rule from (P/M )/ tu J. 

Hence GiO', . . . , G^' is a successful SLD-derivation for GO and (P/M ) / tu J; 
the assumption that M |= (Gslg,)^ is false. So M |= -^(C S \ G )0 for each suc- 
cess leaf G s of Tr. Let D be a finite pseudo-answer of Tr. Then Mo |= ->D0 
for a negative answer ->D of the tree. □ 
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